Run on: 



July 6, 2004, 13:57:03 ; Search time 52.508 Seconds 

(without alignments) 
1382.059 Million cell updates/sec 



Title: 

Perfect score 
Sequence: 



US-10-063-588-80 
1174 

1 MASLGLQLVGYI LGLLGLLG 



PGQPPKVKSEFNSYSLTGYV 230 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched 



1017041 seqs, 315518202 residues 



Total number of hits satisfying chosen parameters: 1017041 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : SPTREMBL 25:* 



1: 




sp_archea : * 


2: 




spjbacteria: * 


3: 




sp_fungi : * 


4 : 




sp_human : * 


5: 




sp_invertebrate : * 


6. 




sp_mammal : * 


1 




sp_mhc : * 


8 




sp_organelle: * 


9 




sp_phage : * 


10: 


sp_ plant : * 


11: 


sp_rodent : * 


12: 


sp_virus : * 


13: 


sp_vertebrate : * 


14- 


sp_unclassif ied: * 


15 


sp_rvirus : * 


16 


sp_bacter iap : * 


17 


sp_archeap : * 



SUMMARIES 

% 



Result 
No. 


Score 


Query 

Match Length DB 


ID 


• 

Description 


1 


456 


38.8 


218 


13 


Q90XR2 


Q90xr2 brachydanio 


2 


452 


38.5 


213 


13 


Q7ZSZ2 


Q7zsz2 xenopus lae 


3 


448 .5 


38.2 


219 


11 


Q810I9 


Q810i9 mus musculu 


4 


448 


38.2 


213 


13 


Q805G0 


Q805g0 xenopus lae 


5 


446 


38.0 


215 


13 


Q90XR8 


Q90xr8 brachydanio 


6 


444 .5 


37.9 


209 


13 


Q90XR9 


Q90xr9 brachydanio 


7 


438 


37.3 


210 


13 


Q90XQ8 


Q90xq8 brachydanio 


8 


432 .5 


36.8 


214 


13 


Q90XQ9 


Q90xq9 xenopus lae 


9 


430 


36.6 


214 


13 


Q9DE12 


Q9del2 xenopus lae 


10 


428 .5 


36.5 


214 


13 


Q98SR2 


Q98sr2 gallus gall 


11 


421 


35.9 


214 


13 


Q90XR0 


Q90xr0 brachydanio 



12 


419 


35. 


7 


13 


417.5 


35. 


6 


14 


413.5 


35. 


2 


15 


399.5 


34. 


0 


16 


396 


33. 


7 



214 13 Q7T2P4 

210 13 Q90XR4 
206 13 Q90XS1 

211 4 Q7Z4X9 
211 



Q7t2p4 brachydanio 
Q90xr4 brachydanio 
Q90xsl brachydanio 
Q7z4x9 homo sapien 



Run on: 



July 6, 2004, 13:58:13 ; Search time 19.9678 Seconds 

(without alignments) 
1107.984 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-063-588-80 
1174 

1 MASLGLQLVGYI LGLLGLLG 



PGQPPKVKSEFNSYSLTGYV 230 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0 . 5 

Searched: 283366 seqs, 96191526 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : 



PIRJ78:* 

1: pirl:* 

2 : pir2 : * 

3: pir3:* 

4: pir4:* 



283366 



SUMMARIES 



Result 
No. 


Score 


Query 
Match 


Length 


DB 


ID 


Description 


1 


423.5 


36.1 


280 


2 


A39484 


androgen - wi thdrawa 


2 


100 


8.5 


400 


2 


AG2866 


MFS permease [drug 


3 


100. 


8.5 


400 


2 


D97643 


probable efflux pr 


4 


99.5 


8.5 


330 


2 


C69648 


2 -keto-3 -deoxygluc 


5 


99.5 


8.5 


463 


2 


F90285 


metabolite transpo 


6 


97 .5 


8.3 


484 


2 


C75609 


amino acid ABC tra 


7 


95.5 


8.1 


458 


2 


H71657 


NADH2 dehydrogenas 


8 


93 


7.9 


523 


2 


T11916 


NADH2 dehydrogenas 


9 


91 


7.8 


160 


2 


JN0503 


peripheral myelin 


10 


90.5 


7.7 


487 


2 


T19237 


hypothetical prote 


11 


90.5 


7.7 


617 


2 


F71019 


hypothetical prote 


12 


88.5 


7.5 


429 


2 


A97241 


permease [imported 


13 


88.5 


7.5 


521 


2 


A99549 


amino acid permeas 


14 


88 .5 


7.5 


593 


2 


H71283 


conserved hypothet 


15 


88 


7.5 


160 


2 


A41144 


growth arrest -rela 



Run on: 



July 6, 2004, 13:56:23 ; Search time 14.791 Seconds 

(without alignments) 
809.691 Million cell updates/sec 



Title: 

Perfect score 
Sequence: 



US-10-063-588-80 
1174 

1 MASLGLQLVGYI LGLLGLLG 



PGQPPKVKSEFNSYSLTGYV 230 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 141681 seqs, 52070155 residues 

Total number of hits satisfying chosen parameters 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



141681 



Database : 



SwissProt 42:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 



Result 
No. 


Score 


Query 

Match Length 


DB 


ID 


Description 


1 


1174 


100. 0 


230 


1 


CLD2 HUMAN 


P57739 homo sapien 


2 


1119 


95.3 


230 


1 


CLD2 CANFA 


Q95km6 canis famil 


3 


1096 


93.4 


230 


1 


CLD2 MOUSE 


088552 mus musculu 


4 


571 


48.6 


239 


1 


CLDE HUMAN 


095500 homo sapien 


5 


561.5 


47. 8 


239 


1 


CLDE_MOUSE 


Q9z0s3 mus musculu 


6 


459.5 


39.1 


208 


1 


CLDY_J3RARE 


Q9yh91 brachydanio 


7 


450.5 


38.4 


218 


1 


CLD3_CANFA 


Q95km5 canis famil 


8 


450 


38.3 


220 


1 


CLD3_HUMAN 


015551 homo sapien 


9 


446 


38.0 


210 


1 


CLD4_M0USE 


035054 mus musculu 


10 


443.5 


37.8 


209 


1 


CLD4_CERAE 


019005 cercopithec 



4 



XX 
PS 
XX 

cc 
cc 
cc 
cc 

XX 
SQ 



Cla/im 1;1 Page 12 (Disclosure 



present sequence 
suppressing activity/ The pr 
tme preparation of the prot 
the protein in treating diseases su 



epresehts human /SP82 protein, whiifch has can 
ssent invention also describes a meth 
by repombinati sn, and tp^ applicat 
as cancer 



jquence E30 AA; 



Queijy Match 1 / 100.0%; 

Best] Local similarity 100.0%; 
Matqhes 230 ;l Conservative 



61 II QCDI Y 3TLLGLPADI QA IQ. 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

Searci completed 
Job ttme : 76.2154 



ILGLQIJVGYI LGLLGLL 

! i i 1 1 J 1 1 1 1 i 1 1 1 ) 1 1 

ILGLQLVG Y I LGLLGLL 



61 I TQCD I Y STLLGLPAD I QA \Q 



181 LCF 



; 42pp; Ch/nese\ 



Scorfe 1174; DB 5; Lghgth\230; 
Prefl. No. 7.3e-L17; 

fismatches 10; Ridels \ 0; /Gaps 



GTLVAMLLPSWKTSS YVGAS I 

lllllllllllllfll 

PSWKTSSYVGkS: 



!A VGF S KGLWME 

Illllllll 

n AVGFS KGtiWME 



'SSAI SSLACI I $ VyGMRCTVFCQlESRAKDRVAVA 0.2 0 



121 GG VFF I L 3GLLGF I P VAWN 



SSAI SSLACI I SX VGMRCTVFCQHSHAKDRVAVA 12 0 



121 GGvFF I L 3GLLGF I PVAWNlHC I LRDF YS PLVPDSM 



181 LCHSCSS 3RNRSNYYDAYQAQI 'LATRSSPRPGQPP 




y 6, 2004, ItH 
cs 



HQ I LRDFYS PLVPDSMKFu IIGEALYLGI ISELFSLIAGII ll8 0 



LFSLIAGII 180 



IGEALYLGI IS 



EFNSYSLTGYV 




0; 



THSTGA 60 



THSTG\60 



RNRSNYYDAYQAQI 'LATRSSPRPGQPP 



EFNSYSLTGYV 



230 



230 



00:34 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd 



OM protein - protein search, using sw model 



Run on: 



July 6, 2004, 13:58:48 ; Search time 63.6013 Seconds 

(without alignments) 
1125.689 Million cell updates/sec 



Title: 

Perfect score 
Sequence: 



US-10-063-588-80 
1174 

1 MASLGLQLVGYI LGLLGLLG 



PGQPPKVKS EFNSYSLTGYV 230 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 



1276540 seqs, 311283816 residues 



Total number of hits satisfying chosen parameters 



1276540 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Published_Applications_AA: * 

1 : /cgn2_6/ptodata/2/pubpaa/US07_PUBCOMB . pep : * 

2 : /cgn2_6/ptodata/2/pubpaa/PCTJtfEW_PUB .pep : * 

3 : /cgn2_6/ptodata/2/pubpaa/US06_NEW_PUB . pep : * 
4 : /cgn2_6/ptodata/2/pubpaa/US06_PUBCOMB.pep: * 
5 : /cgn2_6/ptodata/2/pubpaa/US07_NEW_PUB.pep: * 
6 : /cgn2_6/ptodata/2/pubpaa/PCTUS_PUBCOMB .pep : * 
7: /cgn2_6/ptodata/2/pubpaa/US08_NEW_PUB.pep:* 
8 : /cgn2__6/ptodata/2/pubpaa/US08_PUBCOMB.pep: * 
9 : /cgn2_6/ptodata/2/pubpaa/US09A__PUBCOMB .pep: * 
10: /cgn2_6/ptodata/2/pubpaa/US09B_PUBCOMB.pep: * 
11: /cgn2_6/ptodata/2/pubpaa/US09C_PUBCOMB.pep: * 
12 : /cgn2_6/ptodata/2/pubpaa/US09JtfEW_PUB .pep: * 
13 : /cgn2_6/ptodata/2/pubpaa/US10A_PUBCOMB.pep: * 
14: / cgn2_6 /p t oda t a / 2 /pubpaa / US 1 0 B_PUBCOMB . pep : * 
15: /cgn2_6/ptodata/2 /pubpaa /US 10C_PUBCOMB. pep: * 
16 : /cgn2_6/ptodata/2/pubpaa/US10_NEW_PUB .pep : * 
17 : /cgn2_6/ptodata/2/pubpaa/US60_NEW_PUB .pep: * 
18 : /cgn2_6/ptodata/2/pubpaa/US60_PUBC0MB .pep : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


1174 


100. 


0 


230 


9 


US- 


09- 


998- 


598-2590 


Sequence 2590, Ap 


2 


1174 


100. 


0 


230 


10 


US 


-09 


-888 


-257A-6 


Sequence 


6, Appli 


3 


1174 


100. 


0 


230 


10 


US 


-09 


-946 


-374-134 


Sequence 


134, App 


4 


1174 


100. 


0 


230 


11 


US 


-09 


-833 


-245-1256 


Sequence 


1256, Ap 


5 


1174 


100. 


0 


230 


12 


US 


-10 


-147 


-493-492 


Sequence 


492, App 


6 


1174 


100. 


0 


230 


12 


US 


-10 


-145 


-127-492 


Sequence 


492, App 


7 


1174 


100. 


0 


230 


12 


US 


-10 


-160 


-503-492 


Sequence 


492, App 


8 


1174 


100. 


0 


230 


12 


US 


-10 


-143 


-118-492 


Sequence 


492, App 


9 


1174 


100. 


0 


230 


12 


US 


-10 


-144 


-993-492 


Sequence 


4 92, App 


10 


1174 


100. 


0 


230 


12 


US 


-10 


-158 


-787-492 


Sequence 


4 92, App 


11 


1174 


100. 


0 


230 


12 


US 


-10 


-081 


-056-192 


Sequence 


192, App 


12 


1174 


100. 


0 


230 


12 


US 


-10 


-063 


-745-80 


Sequence 


80, Appl 


13 


1174 


100. 


0 


230 


12 


US 


-10 


-140 


-024-492 


Sequence 


492, App 


14 


1174 


100. 


0 


230 


12 


US 


-10 


-063 


-512-80 


Sequence 


80, Appl 


15 


1174 


100. 


0 


230 


12 


us 


-10 


-063 


-513-80 


Sequence 


80, Appl 


16 


1174 


100. 


0 


230 


12 


us 


-10 


-063 


-515-80 


Sequence 


80, Appl 


17 


1174 


100. 


0 


230 


12 


us 


-10 


-063 


-549-80 


Sequence 


80, Appl 


18 


1174 


100. 


0 


230 


12 


us 


-10 


-063 


-569-80 


Sequence 


80, Appl 


19 


1174 


100. 


0 


230 


12 


us 


-10 


-063 


-551-80 


Sequence 


80, Appl 


20 


1174 


100. 


0 


230 


12 


us 


-10 


-140 


-808-492 


Sequence 


492, App 


21 


1174 


100. 


0 


230 


12 


us 


-10 


-006 


-485A-134 


Sequence 


134, App 


22 


1174 


100. 


0 


230 


12 


us 


-10 


-013 


-907A-134 


Sequence 


134, App 


23 


1174 


100. 


0 


230 


12 


us 


-10 


-015 


-499A-134 


Sequence 


134, App 



Run on: 



July 6, 2004, 13:55:23 ; Search time 73.2154 Seconds 

(without alignments) 
887.599 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-063-588-80 
1174 

1 MASLGLQLVGYI LGLLGLLG 



PGQPPKVKSEFNSYSLTGYV 230 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 



1586107 seqs, 282547505 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



1586107 



Database : A_Geneseq_29Jan04 : * 

1: geneseqpl980s : * 

2: geneseqpl990s : * 

3: geneseqp2000s : * 

4: geneseqp2001s : * 

5: geneseqp2002s : * 

6 : geneseqp2003as : * 

7 : geneseqp2 003bs : * 

8: geneseqp2004s : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


1174 


100. 


0 


230 


2 


AAY36181 


Aay36181 


Human sec 


2 


1174 


100. 


0 


23.0 


3 


AAY84609 


Aay84609 


A human m 


3 


1174 


100. 


0 


230 


3 


AAY99378 


Aay99378 


Human PRO 


4 


1174 


100. 


0 


230 


4 


AAB66127 


Aab66127 


Protein o 


5 


1174 


100. 


0 


230 


4 


AAE04207 


Aae04207 


Human gen 


6 


1174 


100. 


0 


230 


4 


AAU12417 


Aaul2417 


Human PRO 


7 


1174 


100. 


0 


230 


4 


AAU09178 


Aau09178 


Human PRO 


8 


1174 


100. 


0 


230 


4 


AAB87565 


Aab87565 


Human PRO 


9 


1174 


100. 


0 


230 


4 


AAB88342 


Aab88342 


Human mem 


10 


1174 


100. 


0 


230 


5 


ABP67991 


Abp67991 


Human col 


11 


1174 


100. 


0 


230 


5 


ABG95890 


Abg95890 


Human sec 


12 


1174 


100. 


0 


230 


5 


ABB84912 


Abb84 912 


Human PRO 


13 


1174 


100. 


0 


230 


5 


AAU76534 


Aau76534 


Tumour -as 


14 


1174 


100. 


0 


230 


5 


ABG64507 


Abg64507 


Human alb 


15 


1174 


100. 


0 


230 


5 


ABB04707 


Abb04707 


Human SP8 


16 


1174 


100. 


0 


230 


5 


ABB95518 


Abb95518 


Human ang 



1 H 


1 1 HA 
XX 1 *± 


i fifi 
x u u . 


fi 

u 


9 9 fi 
ZO u 


o 


HDUl / O D X 


i\±J\J X / O O X 


IMUVcI 


X1UIU 


1 Q 


1 1 94 
XX l*± 


1UU . 


fi 
u 


9 9 fi 
ZjU 


o 


7VDTTQ1 lie: 
nOUO X X 1j 


rUJUO J.11J 


n Litiictii 


ppo 


1 Q 
Xzf 


1 1 1 A 
XX i t 


i no 

1UU . 


fi 


9 7 fi 


(Z 
D 


ZIRTTQfiQI R 


au u. y U7ij 


INUVCl 


Vrnnn 

I1L11U 


9 n 
z u 


1 1 94 
XX /rt 


i fin 


u 


9 "3 fi 
Z J U 


O 


ARH9 9 Q94 


nJJU J J J / *± 


Hi ima ti 
n Liuidii 




9 1 
Z 1 


1 1 7/1 
XX I *± 


1 fifi 
1UU . 


fi 


Z J u 


O 


ARTT71 QQ1 
riDU / X «? X 


nhn71 QQ1 


LNU VCi 


Vinm 


9 9 
z z 


X X 1 ft 


XUU . 


u 


9 9 fi 
Z J u 




ART Tfifi A 1 R 




n ULUtCLli 


PPO 


9 "3 
Z O 


1 1 HA 
X X 1 *x 


i fifi 

1UU . 


fi 
u 


9 9fi 


o 


APJT91 ^4R 




n unlet i j. 


C3 op 


Zrr 


1 1 n a 

XX f *x 


1 fifi 


fi 
u 


9 9fi 


(Z 


ARR47Q4fi 

nDI\1 / j *± o 


Abr*47Q4fi 


flLlllLClll 


O CL 


ZD 


1 1 HA 
XX J*k 


x u u . 


u 


9 7 fi 
Zo u 






AJJUD 


in u vex 


O £3 f~* 

bet 


Zo 


1 1 HA 
XX /4 


i fi r» 


U 


9 "3 fi 
Zo U 


O 


APT T*7 9 9 9 fi 
.firSU / Z OZ o 


& Hit 9 9 9 9 fi 
nUU / ZO Z O 


nuiuan. 


rKU 


T H 
Z / 


1 1 HA 

XX /4 


1 fi fi 


U 


9 "J fi 
ZO U 


O 


am TQ fi QQ Q 


ahnQftQQ Q 

±\X)\xy U-75J 


riLiiuan. 


trr\XJ 


O Q 

Zo 


1 1 1 A 

XX /4 


1 fi fi 

X u u . 


U 


9 9 fi 
Z 0 u 


D 


aian9 9 9 9 fi 
AoUZ /oz u 


aVw~i9 9 9 9 fi 
JnXJKJZ 1 0 Z U 


XlUIUcill 


o a f~* 

sec 


z y 


1 1 HA 


inn 
X u u . 


u 


9 9 fi 
ZO U 


o 


/VDUZ jUOD 


ftVi^9RfiQfi 


flUlllclIl 


q a 


1 fi 
■D U 


1 1 HA 

XX 1 *t 


1 fi fi 

X u u . 


u 


9 9 fi 
ZO U 


c 

D 


AOKU U X / O 


rvUX U U X / O 


riuiudll 


gen 


J 1 


1 1 HA 
XX 1 *± 


X u u . 


fi 

u 


9 9 fi 
Z O U 


o 


ADI TQ O C 1 C 


i-yju JZ 9 13 


TJi i fti a n 
nulllldll 


OCL 


OZ 


1 1 HA 
XX 1 *± 


i r\r\ 


fi 

u 


9 9 fi 
Z O V 


c 
o 


ADT TQ 1 T DC 

t\auo xxod 


ft Wit A 1 1 A R 
rlX9UO XXOD 


riuiUciii 


C? £3 Z"" 1 


"3 "3 


1 1 HA 
XX l *± 


1 fifi 
IUU . 


fi 
u 


9 9 fi 
Z O U 


o 


-HX3VJDO Z j V 


rtJJU DOZ I? ^ 


IN tj V Cl 


Vii im 
1 J U1U 


041 


1 1 HA 
XX 1 *± 


1 fifi 


fi 
u 


99 fi 

z o u 


o 


aRTT^9 fiQI 


AKi i fi 7 fi Q 1 


n Liiiiciii 


OOP 
OCL 


-3D 


1 1 HA 
XX 1 *± 


x u u . 


fi 

u 


9 9 fi 
ZO U 


(Z 

o 


APT TOO 9 fi9 


Ahi Q A 9 fi9 
rUJU y o o u z 


IN L) V C X 


IlUIIl 


JD 


1 1 HA 
XX / *t 


i f\f\ 

IUU . 


fi 

u 


9 9 fi 
ZO U 


c 
o 


ApT TQ Q 9 fi'7 
ADUO J70U/ 


aV\n A Q9 fi9 
i-vlJLlO J70 U / 


WOVcl 


I1U.IU 


"J T 
J / 


1 1 HA 
XX I % 


XUU. 


A 

u 


9 9 fi 
ZO U 


c 
O 


APT TQOCI >1 
iVBUo Z D X4 


HDUO Z D X*t 


iNovex 


riuiu 


O Q 
JO 


1 1 H A 

XX /4 


X U U . 


u 


*5 7 fi 
Zo U 


b 


7V TJT TQC47Q 

AoUyo^ /o 


R Hi i Q fi 4 H A 


Human 


PRO 


39 


1174 


100 . 


0 


O **i f\ 

230 


6 


ABU72148 


Ax)u72148 


Human 


PRO 


40 


1174 


100. 


0 


230 


6 


ADA46011 


Ada46011 


Novel 


hum 


41 


1174 


100. 


0 


230 


6 


ADA76442 


Ada76442 


Human 


PRO 


42 


1174 


100. 


0 


230 


6 


ADB17137 


Adbl7137 


Human 


tra 


43 


1174 


100. 


0 


230 


6 


ADA19092 


Adal9092 


Human 


PRO 


44 


1174 


100. 


0 


230 


6 


ADA61715 


Ada61715 


Homo sapi 


45 


1174 


100. 


0 


230 


6 


ADB19500 


Adbl9500 


Novel 


hum 



ALIGNMENTS 



RESULT 1 
AAY36181 



Run on: 



July 6, 2004, 13:55:23 ; Search time 73.2154 Seconds 

(without alignments) 
887.599 Million cell updates/sec 



Title: 

Perfect score: 
Sequence: 



US-10-063-588-80 
1174 

1 MASLGLQLVGYILGLLGLLG 



PGQPPKVKSEFNSYSLTGYV 230 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched : 



1586107 seqs, 282547505 residues 



Total number of hits satisfying chosen parameters 



1586107 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : 



A_Geneseq_2 9Jan04 : * 

1: geneseqpl980s : * 

2 : geneseqpl990s : * 

3: geneseqp2000s : * 

4: geneseqp2001s : * 

5: geneseqp2002s : * 

6 : geneseqp2003as : * 

7 : geneseqp2003bs : * 

8: geneseqp2004s : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


1174 


100. 


0 


230 


2 


AAY36181 


Aay36181 


Human sec 


2 


1174 


100. 


0 


230 


3 


AAY84609 


Aay84609 


A human m 


3 


1174 


100. 


0 


230 


3 


AAY99378 


Aay99378 


Human PRO 


4 


1174 


100. 


0 


230 


4 


AAB66127 


Aab66127 


Protein o 


5 


1174 


100. 


0 


230 


4 


AAE04207 


Aae04207 


Human gen 


6 


1174 


100. 


0 


230 


4 


AAU12417 


Aaul2417 


Human PRO 


7 


1174 


100. 


0 


230 


4 


AAU09178 


Aau09178 


Human PRO 


8 


1174 


100. 


0 


230 


4 


AAB87565 


Aab87565 


Human PRO 


9 


1174 


100. 


0 


230 


4 


AAB88342 


Aab88342 


Human mem 


10 


1174 


100. 


0 


230 


5 


ABP67991 


Abp67991 


Human col 


11 


1174 


100. 


0 


230 


5 


ABG95890 


Abg95890 


Human sec 


12 


1174 


100. 


0 


230 


5 


ABB84912 


Abb84912 


Human PRO 


13 


1174 


100. 


0 


230 


5 


AAU76534 


Aau76534 


Tumour-as 


14 


1174 


100. 


0 


230 


5 


ABG64507 


Abg64507 


Human alb 


15 


1174 


100. 


0 


230 


5 


ABB04707 


Abb04707 


Human SP8 


16 


1174 


100. 


0 


230 


5 


ABB95518 


Abb95518 


Human ang 



Run on: 



July 6, 2004, 10:17:26 / Search time 6305 Seconds 

(without alignments) 
10139.722 Million cell updates/sec 



Title: 

Perfect score: 
Sequence: 



US-10-063-588-79 
1475 

1 gagagaagtcagcctggcag 



ctgggacatttaaaaaaata 1475 



Scoring table: IDENTITYJtfUC 

Gapop 10.0 , Gapext 1.0 



Searched: 



3470272 seqs, 21671516995 residues 



Total number of hits satisfying chosen parameters 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



6940544 



Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : GenEmbl : * 



1: 




gb_ba : * 


2. 




gb_htg : * 


3: 




gb__m : * 


4: 




gb_om : * 


5: 




gb ov : * 


6: 




gbjpat : * 


7: 




gb_ph : * 


8: 




gb_j)l : * 


9. 




gb_pr : * 


10 


: gb_ro : * 


11: 


gb_sts : * 


12 


gb_sy:* 


13 


gb_un : * 


14 


gb_vi : * 


15 


em_ba : * 


16 


em_f un : * 


17 


em_hum : * 


18 


em_in : * 


19 


em_mu : * 


20 


em_om : * 


21: 


em_or : * 


22 


em_ov:* 


23 


em_jpat : * 


24 


em jph : * 


25 


em_j>l : * 


26 


; em_ro : * 


27 


em_sts:* 


28 


em_un : * 


29 


em_vi : * 


30 


; em_htg_hum : * 


31 


em_htg_inv : * 


32 


em_htg_other : * 


33 


, em_htg_mus : * 


34 


em_htg_j>ln : * 



J 



3 5 : em_ht g_rod : * 

3 6 : em_htg_mam : * 

37 : em_htg_vrt : * 

38: em_sy:* 

3 9 : em_htgo_hum : * 

40: em_ht go_mus : * 

4 1 : em_htgo_ot her : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 



Result 




Query 












No. 


Score 


Match Length DB 


ID 


Description 


1 


1475 


100.0 


1475 


6 


AX092348 


AX092348 


Sequence 


2 


1475 


100.0 


1475 


6 


AX299996 


AX299996 


Sequence 


3 


1475 


100.0 


1475 


6 


AX395213 


AX395213 


Sequence 


4 


1475 


100.0 


1475 


6 


AX454606 


AX454606 


Sequence 


5 


1475 


100.0 


1475 


6 


AX464358 


AX464358 


Sequence 


6 


1475 


100.0 


1475 


6 


AX491084 


AX491084 


Sequence 


7 


1475 


100.0 


1475 


6 


AX697065 


AX697065 


Sequence 


8 


1475 


100.0 


1475 


9 


AY358474 


AY358474 


Homo sapi 


9 


1475 


100. 0 


2959 


9 


AK075405 


AK0754 05 


Homo sapi 


10 


1475 


100. 0 


109465 


9 


AL158821 


AL158821 


Human DNA 


11 


1473.4 


99.9 


1506 


9 


BC014424 


BC014424 


Homo sapi 


12 


1473.4 


99.9 


1918 


9 


AF177340 


AF177340 


Homo sapi 


13 


1472 


99.8 


2742 


6 


BD237562 


BD237562 


Membrane- 


14 
15 


1470.2 


99.7 


2863 


6 


AX136129 


AX136129 


Sequence 



* 



Run on: 



July 6, 2004, 10:17:25 ; Search time 4776.8 Seconds 

(without alignments) 
9220.961 Million cell updates/sec 



Title: 

Perfect score: 
Sequence: 



US-10-063-588-79 
1475 

1 gagagaagtcagcctggcag 



ctgggacatttaaaaaaata 1475 



Scoring table:. I DENTI TY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 



27513289 seqs, 14931090276 residues 



Total number of hits satisfying chosen parameters: 



55026578 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database 



EST: * 



1: em_estba:* 

2: em_esthum:* 

3: em_estin:* 

4 : em_estmu: * 

5: em_estov:* 

6 : em_estpl : * 

7: em__estro:* 

8 : em_htc : * 

9: gb_estl:* 
10: gb_est2:* 
11: gb_htc:* 
12: gb_est3:* 
13: gb_est4:* 
14: gb_est5:* 
15: em_estfun:* 
16: em_estom:* 
1 7 : em_gs s_hum : * 
18: em_gss_inv:* 
19: em__gss_jpln: * 
2 0 : em_gs s_vr t : * 
21: em_gss_fun:* 
2 2 : em_gs s_mam : * 
23 : em_gss_mus : * 
2 4 : em_gs s_ pro : * 
2 5 : em_gs s_rod : * 
2 6 : em__gs s_j?hg : * 
27 : em_gss_vrl : * 
28 : gb_gssl : * 
29 : gb_gss2 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 





1 

1 


817.8 


55 


. 4 


2829 


11 


AK004990 


AK004990 


Mus muscu 




2 


TOT 

731 


A O 

49 


. 6 


o o o 

822 


12 


BG386074 


BG386074 


602455248 




3 


693 


47 


. 0 


693 


29 


AY4 00251 


AY4 00251 


Homo sapi 




4 


689.8 


A £ 

46 


. 8 


945 


10 


BE513091 


BE513091 


601171545 




r— 

5 


f ^ f" A 

676 . 4 


45 


. 9 


693 


29 


AY400252 


AY4 00252 


Pan trogl 




6 


643 


43 


.6 


680 


10 


BE304667 


BE304667 


601105783 




7 


637.4 


43 


.2 


728 


12 


BG325755 


BG325755 


602424466 




8 


617 


41 


.8 


978 


12 


BG164062 


BG164062 


602341087 




9 


614.4 


41 


.7 


786 


28 


BZ215522 


BZ215522 


CH230-416 




10 


606 


41 


.1 


643 


12 


BG328625 


BG328625 


602427889 




11 


582.2 


39 


.5 


589 


13 


BU071883 


BU071883 


im35dll.y 


c 


12 


577 


39 


.1 


676 


10 


AW385836 


AW385836 


CM2-LT004 


c 


13 


575.6 


39 


. 0 


630 


10 


AW837727 


AW837727 


CM1-LT004 


c 


14 


555.4 


37 


.7 


623 


10 


AW837724 


AW837724 


CM1-LT004 


c 


15 


555 


37 


.6 


555 


13 


BU073403 


BU073403 


im35dll .x 




16 


552.6 


37 


.5 


111 











r 



Run on: 



July 6, 2004, 10:17:25 ; Search time 702.749 Seconds 

(without alignments) 
8916.548 Million cell updates/sec 



Title: 

Perfect score: 
Sequence: 



US-10-063-588-79 
1475 

1 gagagaagtcagcctggcag 



ctgggacatttaaaaaaata 1475 



Scoring table: I DENT I TY_NUC 

Gapop 10.0 , Gapext 1 . 0 



Searched: 



3373863 seqs, 2124099041 residues 



Total number of hits satisfying chosen parameters 



6747726 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



Post -processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : 



N_Geneseq_2 9Jan04 : * 

1: geneseqnl980s : * 

2: geneseqnl990s : * 

3: geneseqn2000s : * 

4 : geneseqn2001as : * 

5 : geneseqn2001bs : * 

6: geneseqn2002s : * 

7 : geneseqn2003as : * 

8 : geneseqn2003bs : * 

9 : geneseqn2003cs : * 

10 : geneseqn2004s : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



% 



Result Query 

No. Score Match Length DB ID Description 



1 


1475 


100. 


0 


1475 


3 


AAA37060 


Aaa37060 


Human PRO 


2 


1475 


100. 


0 


1475 


4 


AAF54296 


Aaf54296 


DNA encod 


3 


1475 


100. 


0 


1475 


4 


AAS2148 9 


Aas21489 


Human cDN 


4 


1475 


100. 


0 


1475 


4 


AAS15360 


Aasl5360 


cDNA enco 


5 


1475 


100. 


0 


1475 


4 


AAF92097 


Aaf 92097 


Human PRO 


6 


1475 


100. 


0 


1475 


6 


ABS74417 


Abs74417 


Human cDN 


7 


1475 


100. 


0 


1475 


6 


ABL88167 


Abl88167 


Human PRO 


8 


1475 


100. 


0 


1475 


6 


ABK11089 


Abkll089 


cDNA enco 


9 


1475 


100. 


0 


1475 


6 


ABL95656 


Abl95656 


Human ang 


10 


1475 


100. 


0 


1475 


7 


ACD24 098 


Acd24098 


Novel hum 


11 


1475 


100. 


0 


1475 


7 


ACA67239 


Aca67239 


cDNA enco 


12 


1475 


100. 


0 


1475 


7 


ACA91203 


Aca91203 


Novel hum 


13 


1475 


100. 


0 


1475 


7 


ACD81580 


Acd81580 


Human cDN 


14 


1475 


100. 


0 


1475 


7 


ACA60402 


Aca60402 


Novel hum 



15 


1475 


100. 


o 


1475 


7 


ACA03848 


16 


1475 


100 . 


0 


1475 


7 


ACA58849 




1 47R 


i no 


o 

V 


1 47R 


7 


nDA 0 7J0O 


18 


1475 


100. 


0 


1475 


7 


ACA64 025 


19 


1475 


100. 


0 


1475 


7 


ACA91289 


20 


1475 


100. 


0 


1475 


7 


ACD45188 


21 


1475 


100. 


0 


1475 


7 


ACD42 04 0 


22 


1475 


100. 


0 


1475 


7 


ACA93736 


23 


1475 


100. 


0 


1475 


7 


ACA67310 



Aca03848 cDNA enco 
Aca58849 cDNA enco 
Abx89386 DNA encod 
Aca64 025 cDNA enco 
Aca91289 cDNA enco 
Acd45188 Human sec 
Acd4204 0 Human sec 
Aca93736 Human cDN 



AF177340 
LOCUS 

DEFINITION 
ACCESSION < 
VERSION 
KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 



TITLE 

JOURNAL 
REFERENCE 
AUTHORS 



TITLE 
JOURNAL 



FEATURES 

source 



CDS 



AF177340 1918 bp mRNA linear PRI 03-OCT-2000 

Homo sapiens clone SP82 claudin 2 mRNA, complete cds. 

AF177340 

AF177340.1 GI:10503979 

* 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 1918) 

Gu,J.R., Wan,D.F., Zhao,X.T., Zhou,X.M., Jiang, H.Q., Zhang, P. P., 
Qin,W.X., Huang, Y., Qiu,X.K., Qian,L.F., He,L.P., Li,H.N., Yu,Y., 
Yu,J. and Han, L.H. 

Novel human cDNA clone with function of inhibiting cancer cell 

growth 

Unpublished 

2 (bases 1 to 1918) 

Gu,J.R., Wan,D.F., Zhao,X.T., Zhou,X.M. , Jiang, H.Q., Zhang, P. P., 
Qin,W.X., Huang,Y., Qiu,X.K., Qian,L.F., He,L.P., Li, H.N. , Yu,Y. , 
Yu,J. and Han, L.H. 
Direct Submission 

Submitted (06-AUG-1999) National Laboratory For Oncogenes & Related 
Genes, Shanghai Cancer Institute, 25/Ln 2200 Xie Tu Road, Shanghai 
200032, P.R. China 

Loca t ion/Qual i f i ers 

1. .1918 

/organism="Homo sapiens" 
/mol_type="mRNA" 
/db_xref ="taxon: 9606" 
/clone="SP82" 
520. .1212 

/note=" similar to Mus musculus claudin 2" 
/ codon_start=l 
/product = 11 claudin 2" 
/protein_id="AAG17984 .1" 
/db_xref = "GI : 10503980" 

/ 1 rans 1 a t i on= " MASLGLQLVGY I LGLLGLLGTLVAMLLPSWKTS S YVGAS I VTAV 
GFS KGLWM ECATHS TG I TQCD I YSTLLGL PAD I QAAQAMMVTS SA I S S LAC 1 1 S WGM 
RCTVF CQE S RAKDRVAVAGGVFF I LGGLLGF I PVAWNLHG I LRDFYS PLVPDSMKFE I 

GEALYLGI ISSLFSLIAGIILCFSCSSQRNRSNYYDAYQAQPLATRSSPRPGQPPKVK 
SEFNSYSLTGYV" 



ORIGIN 



Query Match 99.9%; Score 1473.4; DB 9; Length 1918; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 1474; Conservative 0; Mismatches 1; Indels 0; Gaps 0; 
Qy 1 GAGAGAAGT CAG CCTGG CAGAGAGACT CTGAAATGAGGGATTAGAGGTGTTCAAGGAG CA 60 

MIIIMMIM IMIMIIII II MINI MM II Ml III II I II Ml II MM MM 

Db 399 GAGAGAAGTCAGCCTGGCAGAGAGACTCTGAAATGAGGGATTAGAGGTGTTCAAGGAGCA 4 58 



Qy 

Db 

Qy 



61 AGAGCTTCAGCCTGAAGACAAGGGAGCAGTCCCTGAAGACGCTTCTACTGAGAGGTCTGC 120 

I Ml II III III MM II I M II I MMM II 1 1 M I M I M 1 1 MM II M M M MM 

459 AGAGCTTCAGCCTGAAGACAAGGGAGCAGTCCCTGAAGACGCTTC 518 
121 CATGGCCTCTCTTGGCCTCC^CTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGG 180 



INN II III 1 1 1 1 1 1 1 1 1 > Ml III I Ml II II III Ml III I II II II MM I II I! 

519 CATGGCCTCTCTTGGCCTCCAACTTC 578 
181 CACACTGGTTGCCATGCTGCTCCCCAGC^ 240 

1 1 1 1 1 1 1 1 1 1 1 1 i I i i I M i I i 1 1 1 1 1 [ 1 1 1 1 1 1 1 1 i i i 1 1 1 1 1 1 i i 1 1 1 1 1 i 1 1 M 1 1 1 

579 CACACTGGTTGCCATGCTGCTCCCCAGGTGGAAAAG^ 638 
241 TGTGACAGCAGTTGGCTT^ 300 

IIMII II III II Ml 1 1 M I II 1 1 1 1 1 1 II II II I II I II III I MM II M II I M II 

639 TGTGACAGCAGTTGGCTTCTCCAAGC^ 698 
301 CATCACCCAGTGTGACATCTATAGCACCCTTCTGGGCCTGCCCGCTGA 360 

IIMII II I II I M 1 1 1 1 II I II M MIIIMI II I II III III MM I II M Ml M 1 1 

699 CATCACCCAGTGTGACATCTATAGCACCCTTCTGK^ 758 
361 CCAGGCCATGATGGTGACATCCAGTG 420 

1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ t ! 1 1 1 1 1 1 1 1 1 1 1 1 j 1 1 ) 1 1 1 1 1 1 1 1 1 i 1 1 1 i 1 1 

759 CCAGGCCATGATGGTGACATCCAGTGCAATCTCCTCCCTC 818 
421 GGG CATGAGATGCACAGTCTTCTGCCAGGAA.TCCCGA 480 

I ! 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 i 1 1 1 f ! 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

819 G£K3CATGkGATGCACAGTCTTCTC 878 
481 AGGTGGAGTCTTTTT(^TCCTTGGAGGCCTCCTGGGATTCATTCCTGTTGCCTGGAATCT 540 

1 1 1 1 ! i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ 1 1 1 1 1 1 f 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 

879 AGGTGGAGTCTTTTTCATCCTTGGAGGGCTCCTGGGATTCATTCCTGTTC 938 
541 TCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAGCATGAAATTTGAGAT 600 

1 1 MM II Mill I II II 1 1 1 II M I M M MM II I II II II 1 1 1 II 1 1 1 II M I M M 

939 TCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAGCATGAAATTTGAGAT 998 
601 TGGAGAGGCTCTTTACTTGGGCATTATTTCTTCCCTGTTCTCCCTGATAGCTGGAATCAT 660 

II MM II MM II M II 1 1 1 II Ml M M I II MM MM I II 1 1 1 1 1 1 II I M I M 1 1 

999 TGGAGAGGCTCTTTACTTGGGCATTATTTCTTCCCTGTTCTCCCTGATAGCTGGAATCAT 1058 
661 CCTCTGCTTTTCCTGCTCATCCC^GAGAAATCXSCTCC^ACTACTACGATGCCTACCAAGC 720 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

1059 CCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTACCAAGC 1118 
721 CCAACCTCTTGCCACyUVGGAGCTCTCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGA 780 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

1119 CCAACCTCTTGCCACAAGGAGCTCTCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGA 1178 
781 GTTCAATTCCTACAGCCTGACAGGGTATGTGTGAAGAACCAGGGGCCA 840 

MM II III I II I II Ml II I II II 1 1 II II MM M I M 1 1 1 1 1 1 II II 1 1 1 1 II II II 

1179 GTTCAATTCCTACAGCCTGACAGGGTATGTGTGAAGAACCAGGGGCCAGAGCTGGGGGGT 1238 
841 GGCTGGGTCTGTGAAAAACAGTGGACAGCACCCCGAGGGCCACAGGTGAGQ 900 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

1239 GGCTGGGTCTGTGAAAAACAGTGGACAGCACCCCGAGGGC 1298 
901 ACTGGATCGTGTCAGAAGGTGCTCCTGAGGAT^ 960 

MMMMMMMMMMMMMMMMMMMMMMMMMMMMMM 

1299 ACTGGATCGTGTCAGAAGGTG CTGCTCAGGATAGACTGACTTTGGCCATTGGATTC CA 1358 
961 AAGGCAGAAATGGGGGCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCT 1020 

MMMMMMMMMMMMMMMMMMMI 



Db 


1359 


Qy 


1021 


Db 


1419 


Qy 


1081 


Db 


1479 


Qy 


1141 


Db 


1539 


Qy 


1201 


Db 


1599 


Qy 


1261 


Db 


1659 


Qy 


1321 


Db 


1719 


Ov 


lOO 1 


Db 


1779 


Qy 


1441 


Db 


1839 



1359 AAGGCAGAAATGGGGGCTAGTGTAACAGCATGCAGGTTGAA 1418 



IIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIIIIIII 



II .11 III ill II II II IUM Ml II Ml 



Ml III III IMIIMI I III II 1 1 1 1! IIMIIII Mill lllllll II II III MM 



IIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 



MIIIIIIIIIIIIIIIIIIHIIIIIIilllllMIIMIIIIMIII! 



II III II III Mill 1 1 II I II III INI I III Mil III Mill III 

CCTGGAACCTCCATCCC^CTCTTCT 1778 
1381 (^CTGAAATAAAACC^TCCT^^ 1440 

Ml III Mill Mill lllllll III II III IMIIIIIMIIMIIIII llllll II I 

1779 GAACTGAAATAAAACCATCCTACGGTATCCAGGGAACA.GAA^ 1838 

1441 GGA CAGGAAGG CAG C CTGGGA CATTTAAAAAAATA 1475 

IIIIIIIIMIIIIIIIIIIIIIIIIIIIIIII I 
1839 GGACAGGAAGG CAG CCTGGGA CATTTAAAAAAAAA 1873 



ID CLD2_HUMAN STANDARD; PRT; 230 AA. 

AC P57739; 

DT 16-OCT-2001 (Rel . 40, Created) 

DT 16-OCT-2001 (Rel. 40, Last sequence update) 

DT 15-MAR-2004 (Rel. 43, Last annotation update) 

DE Claudin-2 (UNQ705/PRO1356) . 

GN CLDN2 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N . A. 

RC TISSUE=Epithelium; 

RA Reinecker H.-C, Sakaguchi T., Golden H.M.; 

RL Submitted (MAR-2000) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP SEQUENCE FROM N.A. 

RA Gu J.R., Wan D.F., Zhao X.T., Zhou X.M., Jiang H.Q., Zhang P.P., 

RA Qin W.X., Huang Y., Qiu X.K., Qian L.F., He L.P., Li H.N., Yu Y. , 

RA Yu J. , Han L.H. ; 

RT "Novel human cDNA clone with function of inhibiting cancer cell 

RT growth . " ; 

RL Submitted (AUG-1999) to the EMBL/GenBank/DDBJ databases. 

RN [3] 

RP SEQUENCE FROM N.A. 

RX MEDLINE=22887296; PubMed=12975309 ; 

RA Clark H.F., Gurney A.L., Abaya E., Baker K. et al. 

RT "The secreted protein discovery initiative (SPDI), a large-scale 

RT effort to identify novel human secreted and transmembrane proteins: a 

RT bioinformatics assessment."; 

RL Genome Res. 13:2265-2270(2003). 

RN [4] 

RP SEQUENCE FROM N.A. 

RA Heath P. ; 

RL Submitted (FEB-2002) to the EMBL/GenBank/DDBJ databases. 

RN [5] 

RP SEQUENCE FROM N.A. 

RC TISSUE=Colon, and Kidney; 

RX MEDLINE=22388257; PubMed=12477932 ; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L. , Shenmen CM. , Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F. # Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L. , Marusina K. , Farmer A. A. , Rubin G.M., Hong L. , 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C. , 

RA Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K. J. , Malek J. A. , Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M. , Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J. , Helton E. , Ketteman M. , Madan A. , Rodrigues S. , Sanchez A. , 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U. , Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A.; 



RT "Generation and initial analysis of more than 15,000 full-length 

RT human and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

CC -!- FUNCTION: Component of tight junction (TJ) strands. 

CC -!- SUBCELLULAR LOCATION: Integral membrane protein. 

CC -!- SIMILARITY: Belongs to the claudin family. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinformatics and the EMBL outstation - 

CC the European Bioinformatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 

CC 

DR EMBL; AF250558; AAF98151.1; 

DR EMBL; AF17734 0; AAG17984.1; 

DR EMBL; AY358474; AAQ88838.1; -. 

DR EMBL; AL158821; CAD23055.1; -. 

DR EMBL; BC015252; AAH15252.1; -. 

DR EMBL; BC014424; AAH14424.1; 

DR Genew; HGNC:2041; CLDN2 . 

DR InterPro; I PRO 06 187; Claudin. 

DR InterPro; IPR005411; Claudin2 . 

DR InterPro; IPR006188; Claudin_reg. 

DR InterPro; IPR004031; PMP22_Claudin. 

DR Pfam; PF00822; PMP22_Claudin ; 1. 

DR PRINTS; PR01077; CLAUDIN. 

DR PRINTS; PRO 158 9; CLAUD I N2 . 

DR PROSITE; PS01346; CLAUDIN; 1. 

KW Tight junction; Transmembrane. 

FT TRANSMEM 8 28 POTENTIAL. 

FT TRANSMEM 82 102 POTENTIAL. 

FT TRANSMEM 117 137 POTENTIAL. 

FT TRANSMEM 163 183 POTENTIAL. 

SQ SEQUENCE 230 AA; 24548 MW; 52CA642D4A62B70D CRC64 ; 

Query Match 100.0%; Score 1174; DB 1; Length 230; 

Best Local Similarity 100.0%; Pred. No. 3.7e-92; 

Matches 230; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
Qy 1 MASLGLQLVGYI LGLLGLLGTLVAMLLPSWKTSS YVGAS I VTAVGFSKGLWMECATHSTG 60 

I II MM I MM Ml 1 1 III II 1 1 III II II III II II I II III I II II I It MM III I 

Db 1 MASLGLQLVGYI LGLLGLLGTLVAMLLPSWKTSS YVGAS I VTAVGFSKGLWMECATHSTG 60 

Qy 61 ITQCDI YSTLLGLPADI QAAQAMMVTSSAI SSLACI I S WGMRCTVFCQESRAKDRVAVA 120 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 

Db 61 ITQCDI YSTLLGLPADI QAAQAMMVTSSAI SSLACI I SWGMRCTVFCQESRAKDRVAVA 120 

Qy 121 GGVFFILGGLLGFI PVAWNLHGILRDFYSPLVPDSMKFEIGEALYLGI I SSLFSLIAGI I 180 

I M 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 II I II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 

Db 121 GGVFFILGGLLGFI PVAWNLHGILRDFYSPLVPDSMKFEIGEALYLGIISSLFSLIAGI I 180 

Qy 181 LCFSCSSQRNRSNYYDAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 230 

1 1 1 1 1 1 1 1 i 1 1 1 1 1 i 1 1 ( 1 1 1 1 1 i 1 1 1 1 i 1 1 ii [ 1 1 1 1 r 1 1 1 1 1 1 1 i ii i 

Db 181 LCFSCSSQRNRSNYYDAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 230 




mm 



Pubftfed 



Nucleotide- 



Protein 



Genome 



Structure 



PMC 




Search I Protein 




Limits 



Preview/fndex 



History 



Clipboard 



Taxonomy 



•.v.v: 



Details 



WmMm jGenPept I§ ^Hj all to file I 

^- ^'^•Y^ 'n^'ftt^ya Vri'fi Vn'iff # ............................ VnnninVrv^VitV«.Vii^ > •- — • 1 

Range: from [begin J to jend J Features: DSNP PCDD Iv; MGC P 

r 

01: P57739. Reports Claudin-2 (UNQ705...[gi: 12229749] 



II PRO 



OMiM 



Bo*te 



bib 



8L!nk ; Domains, 
Links 



LOCUS 

DEFINITION 
ACCESSION 
VERSION 
DBSOURCE 



KEYWORDS 
SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 



JOURNAL 
PUBMED 
REMARK 

REFERENCE 
AUTHORS 



TITLE 
JOURNAL 
REMARK 
REFERENCE 
AUTHORS 



P57739 230 aa linear PRI 01-MAY-2005 

Claudin-2 (UNQ705/PRO1356) (SP82) . 

P57739 

P57739 GI:12229749 

swissprot: locus CLD2_HUMAN, accession P57739 ; 
class: standard, 
created: Oct 16, 2001. 
sequence updated: Oct 16, 2001. 
annotation updated: May 1, 2005. 

xrefs: AF250558.1 , AAF98151.1 , AF177340.1 , AAG17984.1 , AY358474. 1 , 
AAQ88838.1 , AK075371. 1 , BAC11575.1 , AL158821. 16 , CAD23055. 1 , 
BC015252.1 , AAH15252.1 , BC014424.1 , AAH14424.1 , BC071747.1 , 
AAH71747. 1 

xrefs (non-sequence databases) : EnsemblENSG00000165376, 
GenewHGNC:2041, H-InvDBHIX0016971, MIM 300520 , GO0016021, 
GO0005923, GO0042802, GO0016338, InterProIPR006187 , 
InterProIPR005411, InterProIPR006188 , InterProIPR004031, 
PfamPF00822, PRINTS PR01 077 , PRINTSPR01589 , PROSITEPS01346 
Tight junction; Transmembrane. 
Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae ; Homo . 

1 (residues 1 to 230) 

Sakaguchi,T. , Gu,X., Golden, H.M., Suh,E., Rhoads,D.B. and 
Reinecker , H . C . 

Cloning of the human claudin-2 5' -flanking region revealed a 
TATA- less promoter with conserved binding sites in mouse and human 
for caudal-related homeodomain proteins and hepatocyte nuclear 
f actor-lalpha 

J. Biol. Chem. 277 (24), 21361-21370 (2002) 
11934881 

NUCLEOTIDE SEQUENCE. 
TISSUE=Colon epithelium 

2 (residues 1 to 230) 

Gu,J.R., Wan,D.F., Zhao,X.T., Zhou,X.M., Jiang, H.Q., Zhang, P. P., 
Qin,W.X., Huang, Y., Qiu,X.K., Qian,L.F., He,L.P., Li, H.N. , Yu,Y., 
Yu,J. and Han, L. H . 
Direct Submission 
Submitted ( ??-AUG-1999) 
NUCLEOTIDE SEQUENCE. 

3 (residues 1 to 230) 

Clark, H.F., Gurney,A.L., Abaya,E., Baker, K., Baldwin, D., Brush, J., 
Chen, J., Chow,B., Chui,C, Crowley, C, Currell,B., Deuel, B., 
Dowd,P., Eaton, D., Foster, J., Grimaldi,C, Gu,Q., Hass,P.E., 
Heldens,S., Huang, A., Kim,H.S., Klimowski, L . , Jin,Y., Johnson, S., 
Lee, J., Lewis, L., Liao,D., Mark,M., Robbie, E., Sanchez, C, 
Schoenf eld, J. , Seshagiri, S . , Simmons, L., Singh, J., Smith, V., 
Stinson,J., Vagts,A., Vandlen,R., Watanabe,C, Wieand,D., Woods, K., 
Xie,M.H., Yansura,D., Yi,S., Yu,G., Yuan, J., Zhang, M. , Zhang, Z., 
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TITLE 

JOURNAL 
PUBMED 
REMARK 

REFERENCE 
AUTHORS 



Goddard,A., Wood, W. I., Godowski,P. and Gray, A. 

The secreted protein discovery initiative (SPDI) , a large-scale 

effort to identify novel human secreted and transmembrane proteins: 

a bioinf ormatics assessment 

Genome Res. 13 (10), 2265-2270 (2003) 

12975309 

NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] . 

4 (residues 1 to 230) 

Ota,T., Suzuki, Y., Nishikawa, T . , Otsuki,T., Sugiyama,T., Irie,R., 
Wakamatsu, A. , Hayashi,K., Sato,H., Nagai,K., Kimura,K., Makita,H., 
Sekine,M., Obayashi,M., Nishi,T., Shibahara, T . , Tanaka,T., 
Ishii,S., Yamamoto,J., Saito,K., Kawai,Y., Isono,Y., Nakamura,Y., 
Nagahari,K., Murakami, K., Yasuda,T., Iwayanagi, T . , Wagatsuma, M. , 
Shiratori, A. , Sudo,H., Hosoiri,T., Kaku,Y., Kodaira,H., Kondo,H., 
Sugawara,M., Takahashi, M. , Kanda,K., Yokoi,T., Furuya,T., 
Kikkawa,E., Omura,Y., Abe,K., Kamihara,K., Katsuta,N., Sato,K., 
Tanikawa,M., Yamazaki,M., Ninomiya,K., Ishibashi, T . , Yamashita, H . , 
Murakawa,K., Fujimori, K., Tanai,H., Kimata,M., Watanabe,M., 
Hiraoka,S., Chiba,Y., Ishida,S., Ono,Y., Takiguchi , S . , Watanabe,S., 
Yosida,M., Hotuta,T., Kusano,J., Kanehori,K., Takahashi-Fuj ii, A. , 
Hara,H., Tanase,T.O., Nomura, Y., Togiya,S., Komai,F., Hara,R., 
Takeuchi,K., Arita,M., Imose,N., Musashino, K. , Yuuki,H., Oshima,A., 
Sasaki, N., Aotsuka,S., Yoshikawa, Y . , Matsunawa, H. , Ichihara,T., 
Shiohata,N., Sano,S., Moriya,S., Momiyama,H., Satoh,N., Takami,S., 
Terashima, Y . , Suzuki, O. , Nakagawa,S., Senoh,A., Mizoguchi, H. , 
Goto,Y., Shimizu,F., Wakebe,H., Hishigaki, H . , Watanabe,T., 
Sugiyama,A., Takemoto,M., Kawakami,B., Yamazaki,M. , Watanabe,K., 
Kumagai,A., Itakura,S., Fukuzumi,Y., Fujimori, Y., Komiyama,M., 
Tashiro,H., Tanigami,A., Fujiwara,T., Ono,T., Yamada,K., Fujii,Y., 
Ozaki,K., Hirao,M., Ohmori,Y., Kawabata,A., Hikiji,T., Kobatake,N., 
Inagaki,H., Ikema,Y., Okamoto,S., Okitani,R., Kawakami,T., 
Noguchi,S., Itoh,T., Shigeta,K., Senba,T., Matsumura, K. , 
Nakajima,Y., Mizuno,T., Morinaga,M., Sasaki, M., Togashi,T., 
Oyama,M., Hata,H., Watanabe,M., Komatsu,T., Mizushima-Sugano, J. , 
Satoh,T., Shirai,Y., Takahashi, Y . , Nakagawa,K., Okumura,K., 
Nagase,T., Nomura, N., Kikuchi,H., Masuho,Y., Yamashita, R . , 
Nakai,K., Yada,T., Nakamura,Y., Ohara,0., Isogai,T. and Sugano,S. 
Complete sequencing and characterization of 21,243 full-length 
human cDNAs 

Nat. Genet. 36 (1), 40-45 (2004) 
14702039 

NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] . 
TISSUE=Teratocarcinoma 

5 (residues 1 to 230) 

Ross,M.T., Grafham,D.V. , Coffey, A. J., Scherer,S., McLay, K. , 
Muzny, D., Platzer,M., Howell, G.R., Burrows, C, Bird, CP., 
Frankish,A., Lovell,F.L., Howe,K.L., Ashurst, J. L. , Fulton, R.S., 
Sudbrak,R., Wen,G., Jones, M.C., Hurles,M.E., Andrews , T . D . , 
Scott, C.E., Searle,S., Ramser,J., Whittaker, A. , Deadman,R., 
Carter, N. P., Hunt,S.E., Chen,R., Cree,A., Gunaratne, P . , Havlak,P., 
Hodgson, A., Metzker,M. L. , Richards, S., Scott, G., Steffen,D., 
Sodergren,E. , Wheeler, D. A. , Worley,K.C. , Ainscough,R. , 
Ambrose, K.D. , Ansari-Lari, M. A. , Aradhya, S . , Ashwell, R. I . , 
Babbage, A.K. , Bagguley, C. L. , Ballabio,A., Banerjee,R., Barker, G.E., 
Barlow,K.F., Barrett, I . P . , Bates, K.N. , Beare,D.M., Beasley,H., 
Beasley,0., Beck, A., Bethel, G. , Blechschmidt, K. , Brady, N., 
Bray-Allen, S . , Bridgeman, A. M. , Brown, A. J., Brown, M. J. , Bonnin,D., 
Bruf ord, E . A. , Buhay,C, Burch, P., Burford,D., Burgess, J., 
Burrill,W., Burton, J., Bye,J.M., Carder, C, Carrel, L., Chako,J., 
Chapman, J. C. , Chavez, D., Chen,E., Chen,G., Chen,Y., Chen,Z., 
Chinault,C, Ciccodicola, A. , Clark, S.Y., Clarke, G. , Clee,C.M., 
Clegg,S., Clerc-Blankenburg, K. , Clifford, K., Cobley,V., Cole,C.G., 
Conquer, J. S . , Corby, N., Connor, R.E., David, R., Davies,J., Davis, C, 
Davis, J., Delgado,0., Deshazo,D., Dhami,P., Ding,Y., Dinh,H., 
Dodsworth, S. , Draper, H., Dugan-Rocha, S . , Dunham, A. , Dunn,M., 
Durbin,K.J., Dutta,!., Eades,T., Ellwood,M., Emery-Cohen, A. , 
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CONSRTM 
TITLE 

JOURNAL 
PUBMED 
REMARK 

COMMENT 



Errington, H. , Evans, K.L., Faulkner, L., Francis, F., Frankland, J. , 
Fraser,A.E., Galgoczy,P., Gilbert, J., Gill,R., Glockner,G., 
Gregory, S . G. , Gribble,S., Griff iths, C. , Grocock,R., Gu,Y., 
Gwilliam, R., Hamilton, C. , Hart, E. A., Hawes,A., Heath, P. D., 
Heitmann, K. , Hennig,S., Hernandez, J. , Hinzmann, B. , Ho,S., Hoffs,M., 
Howden,P.J., Huckle,E.J., Hume, J., Hunt, P.J. , Hunt, A. R., 
Isherwood, J. , Jacob, L., Johnson, D., Jones, S., de Jong, P.J. , 
Joseph, S.S., Keenan,S., Kelly, S., Kershaw, J. K. , Khan,Z., 
Kioschis,P., Klages,S., Knights, A. J. , Kosiura,A., Kovar-Smith, C . , 
Laird, G.K., Langford,C, Lawlor,S., Leversha,M., Lewis, L., Liu,W., 
Lloyd, C, Lloyd, D.M., Loulseged, H . , Loveland, J. E . , Lovell,J.D., 
Lozado,R., Lu,J., Lyne,R., Ma, J., Maheshwari, M. , Matthews, L.H. , 
McDowall,J., McLaren, S., McMurray,A. , Meidl,P., Meitinger, T . , 
Milne, S., Miner, G. , Mistry,S.L., Morgan, M. , Morris, S., Muller, I : , 
Mullikin, J.C. , Nguyen, N. , Nordsiek,G., Nyakatura, G. , O f Dell,C.N., 
Okwuonu,G., Palmer, S., Pandian,R., Parker, D., Parrish,J., 
Pasternak, S . , Patel,D., Pearce,A.V., Pearson, D . M. , Pelan,S.E., 
Perez, L., Porter, K.M., Ramsey, Y., Reichwald, K. , Rhodes, S., 
Ridler, K.A. , Schlessinger, D . , Schueler, M. G. , Sehra, H.K. , 
Shaw-Smith, C. , Shen,H., Sheridan, E .M. , Shownkeen, R. , Skuce,C.D., 
Smith, M. L. , Sotheran, E . C. , Steingruber, H. E . , Steward, C. A. , 
Storey, R., Swann,R.M., Swarbreck, D. , Tabor, P. E., Taudien,S., 
Taylor, T., Teague,B., Thomas, K., Thorpe, A., Timms,K., Tracey,A. , 
Trevanion, S . , Tromans, A. C. , d'Urso,M., Verduzco,D., Villasana, D . , 
Waldron,L., Wall,M., Wang,Q., Warren, J., Warry,G.L., Wei,X., 
West, A. , Whitehead, S . L. , Whiteley, M. N. , Wilkinson, J . E . , 
Willey, D. L. , Williams, G. , Williams, L. , Williamson, A. , 
Williamson, H. , Wilming,L., Woodmansey, R . L. , Wray, P.W., Yen, J., 
Zhang, J., Zhou, J., Zoghbi,H., Zorilla,S., Buck,D., Reinhardt, R. , 
Poustka,A., Rosenthal, A. , Lehrach,H., Meindl,A., Minx, P. J. , 
Hillier, L.W. , Willard, H. F. , Wilson, R.K., Waterston, R. H. , Rice, CM., 
Vaudin,M., Coulson,A., Nelson,D.L., Weinstock, G. , Sulston, J. E . , 
Durbin,R., Hubbard, T., Gibbs,R.A. , Beck,S., Rogers, J. and 
Bentley,D.R. 

The DNA sequence of the human X chromosome 
Nature 434 (7031), 325-337 (2005) 
15772651 

NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA]. 
6 (residues 1 to 230) 

Strausberg, R. L. , Feingold, E .A. , Grouse, L.H. , Derge, J.G. , 
Klausner, R.D. , Collins, F . S . , Wagner, L., Shenmen, C .M. , Schuler, G. D . , 
Altschul, S . F. , Zeeberg,B., Buetow,K.H., Schaefer, C. F. , Bhat,N.K., 
Hopkins, R. F. , Jordan, H., Moore, T., Max,S.I., Wang, J., Hsieh,F., 
Diatchenko, L. , Marusina,K., Farmer, A. A., Rubin, G.M., Hong,L., 
Stapleton,M. , Soares,M.B. , Bonaldo,M.F. , Casavant, T . L . , 
Scheetz, T. E . , Browns tein, M. J. , Usdin, T . B. , Toshiyuki, S . , 
Carninci,P., Prange,C, Raha,S.S., Loquellano, N. A. , Peters, G. J., 
Abramson, R.D. , Mullahy, S . J. , Bosak, S .A. , McEwan, P. J. , 
McKernan, K. J. , Malek, J. A. , Gunaratne, P . H . , Richards, S. , 
Worley,K.C, Hale,S., Garcia, A.M., Gay,L.J., Hulyk,S.W., 
Villalon, D.K. , Muzny, D .M. , Sodergren, E . J. , Lu,X., Gibbs,R.A., 
Fahey,J., Helton, E., Ketteman,M., Madan,A., Rodrigues, S . , 
Sanchez, A., Whiting, M., Madan,A., Young, A. C. , Shevchenko, Y . , 
Bouf f ard, G. G. , Blakesley, R . W. , Touchman, J. W. , Green, E.D. , 
Dickson, M. C. , Rodriguez, A. C. , Grimwood,J., Schmutz,J., Myers, R.M., 
Butterfield, Y.S. , Krzywinski, M. I . , Skalska,U. , Smailus,D.E. , 
Schnerch,A., Schein,J.E., Jones, S.J. and Marra,M.A. 
Mammalian Gene Collection Program Team 

Generation and initial analysis of more than 15,000 full-length 
human and mouse cDNA sequences 

Proc. Natl. Acad. Sci. U.S.A. 99 (26), 16899-16903 (2002) 
12477932 

NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] . 
TISSUE=Colon, Kidney, and Skin 

[FUNCTION] Plays a major role in tight junction-specific 
obliteration of the intercellular space, through 
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Protein 
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calcium- independent cell-adhesion activity (By similarity) . 
[SUBUNIT] Can form homo- and heteropolymers with other CLDN. 
Homopolymers interact with CLDN3, but not CLDN1, homopolymers . 
Directly interacts with TJP1/Z0-1, TJP2/ZO-2 and TJP3/ZO-3 (By 
similarity) . 

[ SUBCELLULAR LOCATION] Integral membrane protein. Tight junctions 
(By similarity) . 

[SIMILARITY] Belongs to the claudin family. 
Location/Qualifiers 
1. .230 

/organism="Homo sapiens" 
/db_xref="taxon: 9606 " 
1. .230 

/gene="CLDN2 " 
1. .230 

/gene="CLDN2 " 
/product= !l Claudin-2" 
1. .7 

/gene="CLDN2 " 
/region_name= "Domain" 
/note="Cytoplasmic (Potential) . " 
/evidence=not_experimental 
8 . .28 

/gene="CLDN2" 

/region_name= "Transmembrane region" 
/note="Potential . " 
/evidence=not_experimental 
29. . 81 

/gene="CLDN2 " 
/region_name= "Domain" 
/note="Extracellular (Potential) ." 
/evidence=not_experimental 
82. .102 
/gene="CLDN2" 

/region_name= "Transmembrane region" 
/note="Potential . " 
/evidence=not_experimental 
103. .116 
/gene="CLDN2 " 
/region_name="Domain" 
/note="Cytoplasmic (Potential) ." 
/evidence=not_experimental • 
117. .137 
/gene="CLDN2 " 

/region_name=" Transmembrane region" 
/note="Potential . " 
/evidence=not_experimental 
138. .162 
/gene="CLDN2 " 
/ region_name= "Domain " 
/note="Extracellular (Potential) . " 
/evidence=not_experimental 
163. .183 
/gene="CLDN2 " 

/region_name= "Transmembrane region" 
/note="Potential . " 
/evidence=not_experimental 
184. .230 
/gene="CLDN2 " 
/region_name= "Domain" 
/note="Cytoplasmic (Potential) . " 
/evidence=not_experimental 
229. .230 
/gene="CLDN2" 
/site_type="unclassif ied" 

/note="Interactions with TJP1, TJP2 and TJP3 (By 



similarity) . " 

/evidence=not_experimental 

ORIGIN 

1 maslglqlvg yilgllgllg tlvamllpsw ktssyvgasi vtavgfskgl wmecathstg 
61 itqcdiystl lglpadiqaa qammvtssai sslaciisw gmrctvfcqe srakdrvava 
121 ggvffilggl lgfipvawnl hgilrdfysp lvpdsmkfei gealylgiis slfsliagii 
181 lcfscssqrn rsnyydayqa qplatrsspr pgqppkvkse fnsysltgyv 
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begin j to [end I O Reverse complemented strand 

F; MGC □ HPRD □ STS 

□ l: NMJ)20384. Reports Homo sapiens clau...[gi: 3 845 5423] 



Features: □ SNP □ CDD 
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LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



linear PRI 23-APR-2005 



REFERENCE 
AUTHORS 

TITLE 



JOURNAL 
PUBMED 
REMARK 

REFERENCE 
AUTHORS 
TITLE 



JOURNAL 
PUBMED 
REMARK 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
PUBMED 
REMARK 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
PUBMED 
REMARK 

REFERENCE 
AUTHORS 

TITLE 



JOURNAL 



NM_020384 2959 bp mRNA 

Homo sapiens claudin 2 { CLDN2 ) , mRNA. 
NM_020384 

NM 020384.2 GI:38455423 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Euarchontoglires; Primates; Catarrhini; 
Hominidae; Homo. 

1 (bases 1 to 2959) 

Mankertz,J., Hillenbrand, B . , Tavalali,S., Huber,0., Fromm,M. and 
Schulzke, J.D. 

Functional crosstalk between Wnt signaling and Cdx-related 
transcriptional activation in the regulation of the claudin-2 
promoter activity 

Biochem. Biophys. Res. Commun. 314 (4), 1001-1007 (2004) 
14751232 

GeneRIF: role for the paracellular barrier function by opening 
pores for small cations 

2 (bases 1 to 2959) 

Colegio,O.R. , Van Itallie,C, Rahner,C. and Anderson, J. M. 
Claudin extracellular domains determine paracellular charge 
selectivity and resistance but not tight junction fibril 
architecture 

Am. J. Physiol., Cell Physiol. 284 (6), C1346-C1354 (2003) 
12700140 

GeneRIF: Results support a model in which claudins 2 and 4 create 
paracellular channels and the first extracellular domain is 
sufficient to determine both paracellular charge selectivity and 
transepithelial electrical resistance. 

3 (bases 1 to 2959) 

Gonzalez-Mariscal, L. , Betanzos,A., Nava,P. and Jaramillo, B . E . 
Tight junction proteins 

Prog. Biophys. Mol . Biol. 81 (1), 1-44 (2003) 

12475568 

Review article 

4 (bases 1 to 2959) 
Tsukita,S. and Furuse,M, 

Claudin-based barrier in simple and stratified cellular sheets 
Curr. Opin. Cell Biol. 14 (5), 531-536 (2002) 
12231346 
Review article 

5 (bases 1 to 2959) 

Sakaguchi,T. , Gu,X., Golden, H.M., Suh,E., Rhoads,D.B. and 
Reinecker, H. C. 

Cloning of the human claudin-2 5 1 -flanking region revealed a 
TATA- less promoter with conserved binding sites in mouse and human 
for caudal-related homeodomain proteins and hepatocyte nuclear 
f actor-lalpha 

J. Biol. Chem. 277 (24), 21361-21370 (2002) 



PUBMED 
REMARK 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
PUBMED 
REMARK 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
PUBMED 
REMARK 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 
PUBMED 
REMARK 

REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 
COMMENT 



FEATURES 

source 



gene 



CDS 



11934881 

GeneRIF: Cloning of the human claudin-2 5' -flanking region revealed 
a TATA- less promoter with conserved binding sites in mouse and 
human for caudal- related homeodomain proteins and hepatocyte 
nuclear f actor-lalpha . 

6 (bases 1 to 2959) 
Tsukita,S., Furuse,M. and Itoh,M. 
Multifunctional strands in tight junctions 
Nat. Rev. Mol. Cell Biol. 2 (4), 285-293 (2001) 
11283726 

Review article 

7 (bases 1 to 2959) 

* 

Heiskala,M., Peterson, P. A. and Yang,Y. 

The roles of claudin superfamily proteins in paracellular transport 
Traffic 2 (2), 93-98 (2001) 
11247307 
Review article 

8 (bases 1 to 2959) 
Kniesel,U. and Wolburg,H. 

Tight junctions of the blood-brain barrier 
Cell. Mol. Neurobiol. 20 (1), 57-76 (2000) 
10690502 
Review article 

9 (bases 1 to 2959) 

Itoh,M., Furuse,M., Morita,K., Kubota,K., Saitou,M. and Tsukita,S. 
Direct binding of three tight junction-associated MAGUKs, ZO-1, 
ZO-2, and ZO-3, with the COOH termini of claudins 
J. Cell Biol. 147 (6), 1351-1363 (1999) 
10601346 

10 (bases 1 to 2959) 

Morita,K., Furuse,M., Fujimoto,K. and Tsukita,S. 

Claudin multigene family encoding f our-transmembrane domain protein 
components of tight junction strands 

Proc. Natl. Acad. Sci. U.S.A. 96 (2), 511-516 (1999) 
9892664 

11 (bases 1 to 2959) 

Furuse,M., Fujita,K., Hiiragi,T., Fujimoto,K. and Tsukita,S. 
Claudin-1 and -2: novel integral membrane proteins localizing at 
tight junctions with no sequence similarity to occludin 
J. Cell Biol. 141 (7), 1539-1550 (1998) 
9647647 

VALIDATED REFSEQ : This record has undergone preliminary review of 
the sequence, but has not yet been subject to final review. The 
reference sequence was derived from AK075405 . 1 . 
On Nov 20, 2003 this sequence version replaced qi : 9966780 . 

Summary: Members of the claudin protein family, such as CLDN2 , are 
expressed in an organ-specific manner and regulate the 
tissue-specific physiologic properties of tight junctions 
(Sakaguchi et al., 2002) . [supplied by OMIM] . 

Location/Qualifiers 

1. .2959 

/organism="Homo sapiens" 

/mol_type="mRNA" 

/db_xref =" taxon : 9606 " 

/chromosome="X" 

/map= ,, Xq22.3-23 ,, 

1. .2959 

/gene="CLDN2" 

/db_xref="GeneID: 9075 " 

/db_xref="HPRD:HPRD_06471" 

/db_xre f ="MIM : 300520 " 

327. .1019 

/gene="CLDN2" 

/note="go_component : membrane [goid 0016020 ] [evidence 
IEA] ; 



go_component : tight junction [goid 0005923 ] [evidence 
ISS ] ; 

go_component : integral to membrane [goid 0016021 ] 
[evidence ISS]; 

go_f unction: protein self binding [goid 0042802 ] [evidence 
ISS]; 

go_f unction: structural molecule activity [goid 0005198 ] 
[evidence IEA] ; 

go_process: calcium-independent cell-cell adhesion [goid 

0016338 ] [evidence ISS]" 

/codon_start=l 

/product="claudin 2" 

/protein id=" NP 065117.1 " 

/db_xref="GI: 9966781" 

/ db_x re f = " CCDS : CCDS14524 . 1 " 

/db xref="GeneID: 9075 " 

/db_xref="HPRD:HPRD_06471" 

/db xref="MIM: 300520 " 

/translation="MASLGLQLVGYILGLLGLLGTLVAMLLPSWKTSSYVGASIVTAV 
GF S KGLWME CATHST GI TQCDIYSTLLGL PAD I QAAQAMMVT S S AI S S LAC 1 1 SWGM 
RCTVFCQESRAKDRVAVAGGVFFI LGGLLGFI PVAWNLHGI LRDFYS PLVPDSMKFEI 
GEALYLGIISSLFSLIAGIILCFSCSSQRNRSNYYDAYQAQPLATRSSPRPGQPPKVK 
SEFNSYSLTGYV" 



ORIGIN 



1 agatggattt tgcaaagctg tggttaacga ttagaaatcc tttatcacct cagcccgtgg 
61 ccccttgtac ttcgctcccc tccctcagga tccctttctc cctctccagg ggcatctccc 
121 cctccaaggc tctgcaaaga actgccctgt cttctagatg ccttcttgag gctgcttgtg 
181 gccacccaca gacacttgta aggaggagag aagtcagcct ggcagagaga ctctgaaatg 
241 agggattaga ggtgttcaag gagcaagagc ttcagcctga agacaaggga gcagtccctg 
301 aagacgcttc tactgagagg tctgccatgg cctctcttgg cctccaactt gtgggctaca 
361 tcctaggcct tctggggctt ttgggcacac tggttgccat gctgctcccc agctggaaaa 
421 caagttctta tgtcggtgcc agcattgtga cagcagttgg cttctccaag ggcctctgga 
481 tggaatgtgc cacacacagc acaggcatca cccagtgtga catctatagc acccttctgg 
541 gcctgcccgc tgacatccag gctgcccagg ccatgatggt gacatccagt gcaatctcct 
601 ccctggcctg cattatctct gtggtgggca tgagatgcac agtcttctgc caggaatccc 
661 gagccaaaga cagagtggcg gtagcaggtg gagtcttttt catccttgga ggcctcctgg 
721 gattcattcc tgttgcctgg aatcttcatg ggatcctacg ggacttctac tcaccactgg 
781 tgcctgacag catgaaattt gagattggag aggctcttta cttgggcatt atttcttccc 
841 tgttctccct gatagctgga atcatcctct gcttttcctg ctcatcccag agaaatcgct 
901 ccaactacta cgatgcctac caagcccaac ctcttgccac aaggagctct ccaaggcctg 
961 gtcaacctcc caaagtcaag agtgagttca attcctacag cctgacaggg tatgtgtgaa 
1021 gaaccagggg ccagagctgg ggggtggctg ggtctgtgaa aaacagtgga cagcaccccg 
1081 agggccacag gtgagggaca ctaccactgg atcgtgtcag aaggtgctgc tgaggataga 
1141 ctgactttgg ccattggatt gagcaaaggc agaaatgggg gctagtgtaa cagcatgcag 
1201 gttgaattgc caaggatgct cgccatgcca gcctttctgt tttcctcacc ttgctgctcc 
1261 cctgccctaa gtccccaacc ctcaacttga aaccccattc ccttaagcca ggactcagag 
1321 gatccctttg ccctctggtt tacctgggac tccatcccca aacccactaa tcacatccca 
1381 ctgactgacc ctctgtgatc aaagaccctc tctctggctg aggttggctc ttagctcatt 
1441 gctggggatg ggaaggagaa gcagtggctt ttgtgggcat tgctctaacc tacttctcaa 
1501 gcttccctcc aaagaaactg attggccctg gaacctccat cccactcttg ttatgactcc 
1561 acagtgtcca gactaatttg tgcatgaact gaaataaaac catcctacgg tatccaggga 
1621 acagaaagca ggatgcagga tgggaggaca ggaaggcagc ctgggacatt taaaaaaata 
1681 aaaatgaaaa aaaaacccag aacccatttc tcagggcact ttccagaatt ctctcatatt 
1741 tgtgggctgg gatcaagcct gcagcttgag gaaagcacaa ggaaaggaaa gaagatctgg 
1801 tggaaagctc aggtggcagc ggactctgac tccactgagg aactgcctca gaagctgcga 
1861 tcacaacttt ggctgaagcc cctgcctcac tctagggcac ctgacctggc ctcttgccta 
1921 aaccacaagg ctaagggcta tagacaatgg tttccttagg aacagtaaac cagtttttct 
1981 agggatggcc cttggctggg ggatgacagt gtgggagctg tggggtactg aggaagacac 
2041 cattccttga cggtgtctaa gaagccaggt ggatgtgtgt ggtggctcca gtgggtgttt 
2101 ctactctgcc agtgagaggc agccccctag aaactcttca ggcgtaatgg aaaatcagct 
2161 caaatgagat caggcccccc cagggtccac ccacagagca ctacagagcc tctgaaagac 
2221 catagcacca agcgagcccc ttcagattcc cccactgtcc atcggaagat gctccagagt 
2281 ggctagaggg catctaaggg ctccagcatg gcatatccat gcccacggtg ctgtgtccat 
2341 gatctgagtg atagctgcac tgctgcctgg gattgcagct gaggtgggag tggagaatgg 
2401 ttcccaggaa gacagttcca cctctaaggt ccgaaaatgt tccctttacc ctggagtggg 
2461 agtgaggggt catacaccaa aggtattttc cctcaccagt ctaggcatga ctggcttctg 



// 



2521 aaaaattcca gcacacctcc tcgaacctca ttgtcagcag agagggccca tctgttgtct 

2581 gtaacatgcc tttcacatgt ccaccttctt gccatgttcc agctgctctc ccaacctgga 

2641 aggccgtctc cccttagcca agtcctcctc aggcttggag aacttcctca gcgtcacctc 

2701 cttcattgag ccttctctga tcactccatc cctctcctac ccctccctcc cccaaccctc 

2761 aatgtataaa ttgcttcttg atgcttagca ttcacaattt ttgattgatc gttatttgtg 

2821 tgtgtgtgtc cgatctcaca agtatattgt aaacccttcg gtgggtgggg gccatatcct 

2881 agacctctct gtatccccca gactatctgt aacagtgcca ggcacacagt aggtgatcaa 
2941 taaacacttg ttgattgag 
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source 



BE513091 945 bp mRNA linear EST 07-AUG-2000 

601171545F1 NIH_MGC_15 Homo sapiens cDNA clone IMAGE : 3545171 5', 
mRNA sequence. 
BE513091 

BE513091.1 GI : 9720302 
EST. 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 945) 

NIH-MGC http://mgc.nci.nih.gov/. 

National Institutes of Health, Mammalian Gene Collection (MGC) 

Unpublished (1999) 

Contact: Robert Strausberg, Ph.D. 

Email : cgapbs-r@mail .nih.gov 

Tissue Procurement : ATCC 
cDNA Library Preparation: Ling Hong/Rubin Laboratory 
cDNA Library Arrayed by: The I.M.A.G.E. Consortium (LLNL) 
DNA Sequencing by: NIH Intramural Sequencing Center 
Clone distribution: MGC clone distribution information can be 

found through the I.M.A.G.E. Consortium/ LLNL at: image.llnl.gov 

Plate: LLCM241 row: i column: 12 

High quality sequence start: 36 

High quality sequence stop: 782. 
Location/Qualifiers 
1. .945 

/organism="Homo sapiens" 
/mol__type= " mRNA" 
/db_xref ="taxon: 9606" 
/clone= " IMAGE : 354 5171 " 

/tissue_type="adenocarcinoma cell line" 
/lab_host="DH10B (phage-resistant) " 
/clone_lib="NIH_MGC_15" 

/note="Organ: colon; Vector: pOTB7; Site_l: Xhol; Site_2 : 
EcoRI; cDNA made by oligo-dT priming. Directionally 
cloned into EcoRl/XhoI sites using the following 5 ! 
adaptor: GGCACGAG (G) . Size-selected >500bp for average 
insert size 1.8kb. Library constructed by Ling Hong in 
the laboratory of Gerald M. Rubin (University of 
California, Berkeley) using ZAP-cDNA synthesis kit 
(Stratagene) and Superscript II RT (Life Technologies)" 



ORIGIN 



Query Match 46.8%; Score 689.8; DB 10; Length 945; 

Best Local Similarity 94.1%; Pred. No. 7.9e-190; 
Matches 761; Conservative 0; Mismatches 42; Indels 



6 ; Gaps 



4; 



Qy 

Db 



1 GAGAGAAGTCAG CCTGG CAGAGAGACTCTGAAATGAGGGATTAGAGGTGTT CAAGGAG CA 60 

I II I II II II Mil II 1 1 1 M I II Mill II II MM 1 1 II II 1 1 M 1 1 1 1 1 1 M I II II 

66 GAGAGAAGTCAG CCTGG CAGAGAGACT CTGAAATGAGGGATTAGAGGTGTT CAAGGAG CA 125 



Qy 

Db 



61 AGAG CTTCAGCCTGAAGACAAGGGAGCAGTCCCTGAAGACG CTTCTACTGAGAGGTCTGC 120 

1 1 1 1 1 1 1 1 1 1 M 1 1 II 1 1 1 1 1 1 1 M I M I M 1 1 M 1 1 1 1 1 [ I M 1 

126 AGAG CTTCAGCCTGAAGACAAGGGAGCAGTCCCTGAAGACG CTTCTACTGAGAGGTCTGC 185 



Qy 



121 CA.TGGCCTCTCTTGGCCTCCAACTTGTGGGCTACATCCT^ 180 



Ml Ml II I Ml I II 1 1 1 1 1 II I II 1 1 III 1 1 Mill II II 1 1 1 1 1 1 III Mill 1 1 

186 CATGGCCTCTCTTGGCCTCCAACTTG 245 
181 CACACTGKjITGCCATGCT 240 

II I Ml M I M 1 1 M 1 1 1 1 M 1 1 1 1 II Ml 1 1 1 II M II 1 1 1 II 1 1 1 II Ml I II I II I ! 

246 CACACTGK3TTGCCATGCTGCTCCC 305 
241 TGTGACAGCAGTTGGCTTCTCCAAGX^ 300 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 

306 TGTGACAGCAGTTGGCTTCTC 365 
301 CATCACCC^GTGTGACATCTATAGCACCCT^ 360 

II 1 1 II 1 1 1 1 1 Ml I Ml II II I M 1 1 1 1 1 1 1 1 1 1 II 1 1 MM 1 1 1 1 1 1 M I Ml III II 

366 CATCACCCAGTGTGACATCTATAGCACCCTTCrnjGGCCTC 425 
361 CCAGGCCATGATGGTGACATCCAGTGCAATCTCCTC^ 420 

M I Ml 1 1 1 M I MMI I M MM I M Ml 1 1 1 M M 1 1 M 1 1 M II II 1 1 M M Ml M 

426 CCAGGCCATGATGGTGACATCCAGTC 485 
421 GGGCATGAGATGCACAGTCTTCTCCCAGGAATCCO^ 480 

I i 1 1 1 i 1 1 1 ! 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 i 1 1 1 Mill III Ml MM I 

486 GGGCATGAGATGCACAGTCTTCTC 545 

481 AGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTGG 54 0 

I II II I II I I II II I II II II I II II II II I II I I I I II I II II II II I II I I I II II I 

546 AGGTGGAGTC-TTTTCATCCTTGGAGGCCTCCTGGGATTC^TTCCTGTTGCCTGGAATCT 604 

541 TCATGGGATCCTACGGGACTTCTA -G 598 

MM II I M 1 1 1 1 1 1 II M 1 1 1 1 II 1 1 1 1 Ml I II 1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 M 

605 TCATGGGATCCTACGGGACTTCTACT 664 

599 ATTGGAGAGGCTCTTTACTTGGGCATTATTTCTTCCCTGTTCTCCCTGATAGCTGGAATC 658 

I I MIIIMMIIIMIMM I II I II I I I I I I I I I I I I I I I I I I 
665 TTGGCAGAGGCTCTTTACTTGGGCT 724 

659 A - TCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACXIATC 717 

I MIMMMMMMMMMMMMMIMM 1 1 1 i I M I M 1 1 1 1 1 M I M I If 

725 ATTCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGGTCCAACTACTACGATGCCTACCA 784 
718 AGCCCAACCTCTTGCCACAAGGAGCTCTC 777 

III III MMMIIMI III II I MM II III llllll 

785 AGK3CGCCACCTCTTGCCACAGKIAGCTCTCC 844 

778 TGAGTTCAATTCCTACAGCCTGACAGGGT 806 

I I I I I I I I I I I I I I I I I I I I I I I I 
845 T- -GGTTCATTCCTACAGCCTGACAGGGT 871 



> 



ID AAB88342 standard; protein; 230 AA. 

DT 23-MAY-2001 (first entry) 

DE Human membrane or secretory protein clone PSEC0059. 
XX 

KW Human; secretory protein; membrane protein; vaccine; gene therapy; 

KW rheumatoid arthritis; diabetes, 

XX 

OS Homo sapiens. 
XX 

PN EP1067182-A2. 
XX 

PD 10-JAN-2001. 
XX 

PF 07-JUL-2000; 2000EP-00114090 . 
XX 

PR 08-JUL-1999; 99JP-00194179 . 

PR ll-JAN-2000; 2000JP-00118775 . 

PR 02-MAY-2000; 2000 JP-00183766 . 
XX 

PA (HELI-) HELIX RES INST. 
XX 

PI Ota T, Isogai T, Nishikawa T, Kawai Y, Sugiyama T, Hayashi K; 
XX 

DR WPI; 2001-093989/11. 

DR N-PSDB; AAF93769. 
XX 

PT Nucleic acids encoding secretory proteins /membrane proteins, useful in 

PT gene therapy or as candidate target molecules in drug development. 
XX 

PS Claim 1; SEQ ID NO 52; 609pp + Sequence Listing; English. 
XX 

CC This invention relates to nucleic acid sequences AAF93744 - AAF93916 

CC which encode human secretory or membrane proteins represented by AAB88317 

CC - AAB88419. Included in the invention are primers AAF93917 - AAF94295 and 

CC AAF62232 - AAF62235 which are used to isolate the cDNA sequences of the 

CC invention. The invention also includes methods for the production of 

CC antibodies directed against the proteins, and cDNA sequences, which can 

CC be used in vaccines. The polynucleotide sequences can be used in gene 

CC therapy. The polynucleotide sequences and the proteins they encode may be 

CC used in the prevention, treatment and diagnosis of diseases associated 

CC with inappropriate secretory protein/membrane protein expression. The 

CC nucleic acids and complementary sequences may also be used as DNA probes 

CC in diagnostic assays (e.g. polymerase chain reactions (PCR) ) to detect 

CC and quantitate the presence of similar nucleic acid sequences in samples. 

CC They may also be used to study the expression and function of secretory 

CC proteins /membrane polypeptides and their role in metabolism. The 

CC polypeptides may be used as antigens in the production of antibodies 

CC against them and in assays to identify modulators (agonists and 

CC antagonists) of expression and activity. The antibodies and antagonists 

CC may also be used as therapeutic agents to down regulate expression and 

CC activity. The antibodies may also be used as diagnostic agents for 

CC detecting the presence of the polypeptides in samples (e.g. by enzyme 

CC linked immunosorbant assay (ELISA) . Examples of diseases which may be 

CC treated include rheumatoid arthritis and diabetes 
XX 

SQ Sequence 230 AA; 



*• 



Query Match 100.0%; Score 1174; DB 4; Length 230; 

Best Local Similarity 100.0%; Pred. No. 7.3e-117; 

Matches 230; Conservative 0; Mismatches 0; Indels 0; Gaps 0 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy ' 


121 


Db 


121 


Qy 


181 


Db 


181 



II 



II 



II 



III 



II 



MINIM 
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II 



MIMMIMIMI 



RESULT 10 
ABP6I7S91 
ID 
XX 
AC 
XX 
DT 
XX 
DE 
XX 
KW 
XX 
OS 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
PR 
PR 
PR 
XX 
PA 
XX 
PI 
XX 
DR 
DR 
XX 
PT 
PT 
PT 



rotein; 230 AA. 



related polypeptide SEQ ID 
Lc ,L„, 9 e„e 

li 




lault FA, Meagher MJ, 
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linear PAT 30-MAY-2001 



CDS 



AX136129 2863 bp DNA 

Sequence 51 from Patent EP1067182. 
AX136129 

AX136129 . 1 GI : 14272537 



Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 

Ota,T., Isogai,T., Nishikawa,T. , Kawai,Y., Sugiyama,T. and 
Hayashi, K. 

Secretory protein or membrane protein 
Patent: EP 1067182-A 51 10-JAN-2001; 
Helix Research Institute (JP) 

Locat ion/Qual i f iers 

1. .2863 

/organism="Homo sapiens" 
/mol_type="unas signed DNA" 
/db_xref ="taxon: 9606" 
231. .923 

/note= "unnamed protein product" 

/codon_start=l 

/protein_id= " CAC3 97 07 . 1 " 

/db_xref ="GI : 14272538" 

/ db_xr e f = " REMTREMBL : CAC3 9707" 

/ 1 rans 1 a t i on= " MAS LGLQL VG Y I LGLLGLLGTLVAMLLPSWKTSS YVGAS I VTAV 
GFSKGLWMECATHSTGI TQCDI YSTLLGLPADI QAAQAMMVTSSAI SSLACI I SWGM 
RCTVFCQESRAKDRVAVAGGVFFILGGLLGFIPVAWNLHGILRDFYSPLVPDSMKFEI 
GEALYLGI ISSLFSLIAGI ILCFSCSSQRNRSNYYDAYQAQPLATRSSPRPGQPPKVK 
SEFNSYSLTGYV" 



ORIGIN 



Query Match 99.7%; Score 1470.2; DB 6; Length 2863; 

Best Local Similarity 99.8%; Pred. No. 0; 

Matches 1472; Conservative 0; Mismatches 3; Indels 0; Gaps 0; 

1 GAGAGAAGTCAGCCTGGCAGAGAGACTCTGAAATGAGGGATTAGAGGTGTTCAAGGAGCA 60 



Qy 


i 


Db 


110 


Qy 


61 


Db 


170 


Qy 


121 


Db 


230 


Qy 


181 


Db 


290 


Qy 


241 


Db 


350 



II 

110 GAGAGAAGTCAGCCTGGCAGAGAGACTCTGAAATGAGGGATTAGAGGTGTTCAAGGAGCA 169 



II 



1 1 1 1 M I 1:1 I hi I 



I 



II 



llllllll 



181 CACACTGGTTGCCATGCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCAT 240 

IIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

290 CACACTGGTTGCCATGCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCAT 349 

TGACAGCAGTTGGCTTCTCCAAGGGCCTCTGGATGGAATGTGCCACACACAGCACAGG 300 
II 



Qy 


301 


CATCACCCAGTGTGACATCTATAGC^ 

i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i i i i 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CATCACCCAGTGTGACATCTATAGCACCCTTCTGGGCCTC 


360 


Db 


410 


469 


Qy 


361 


CCAGGCCATGATGGTGACATCC^GTC 

i i i i i i i i i i i i i i i i i i > t i i i i i i i i i i i i i i i i i i i i i i t i i t < i i i i i i i i i i i i i 


420 


Db 


470 


III MINI II MM II III III II II II II MM II II II II II II II II II II III II 

CCAGGCCATGATGGTGACATCCAGTGCAATCT 


529 


Qy 


421 


GGGCATGAGATGCACAGTCTTC^ 

i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i * i i i i i i i i i i i i i i i i i i 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GGGCATGAGATGCACAGTCTTCTGC^GGAATCCQ^GCCAAAGACAGAGTGGCGGTAGC 

• 


480 


Db 


530 


589 


Qy 


481 


AGGTGGAGTCTTTTTCATCCTTGGA 

M 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 M II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 II 1 1 1 

AGGTGGAGTCTTTTTCATCCTTGGAGGCCTCCTC 


540 


Db 


590 


649 


Qy 

Db 


541 
650 


TCATGGGATCCTACXjGGACTTCTACTC 

i i i i i i i i i t i i i i i i i i i i i t i i < i i i i i i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i 
II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 II II II 1 1 1 1 1 1 II 1 1 1 II 1 II II 1 1 1 1 1 1 1 1 1 1 

TCATGGGATCCTACGGGACTTCT^ 


600 
709 


Qy 


601 


TGGAGAGGCTCTTTACTTGGGCATTATT^ 

i i i i i i i i i i i f i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i t i t i i i i i i i t i i i i i i i 


660 


Db 


710 


1 1 1 II 1 1 II II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 M 1 1 II 1 1 1 1 1 1 1 II II 

TGGAGAGGCTCTTTACTTGGGCATTATTTCT 


769 


Qy 


661 


CCTCTGCTTTTCCTGCT(^TCCCAGAGAAATCGCTCCAACTACTAOTATGCCTACCAAGC 

iiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiitiiiiiiiiiiitiiiiiiiiiiiiii 


720 


Db 


770 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

CCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTACCAAGC 


829 


Qy 

Db 


721 
830 


CCAACCTCTTGCCACAAGK1AGCTCTC 

1 1 II 1 1 1 1 1 1 M 1 1 II 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 M II II II M M M 

CCAACCTCTTGCCACAAGGAGCTCTCCA^ 


780 
889 


Qy 


781 


GTTC^TTCCTAC^GCCTGAC^GGGTATGTGTGAAGAACCAGGGGCCAGAGCTGGGGGGT 
iitiiiiiiiiiiiiiiitiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiiitiiiiii 


840 


Db 


890 


MIMIMMMMMMMMIMMMMIIIMMMMMMIMMMMIIMI 

GTTCAATTCCTACAGCCTGACAGGGTATGTGTGAAGAACCAGGGGCCAGAGCTGGGGGGT 


949 


Qy 


841 


GGCTGGGTCTGTGAAAAACAGTGGACAGCACCCCGAGGGCCA 

lllllll llllllllllll IAII lllllllllllllllltlllllllllltllillllll 

1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 

GGCTGGGTCTGTGAAAAACAGTGGACAGCACCCCGAGGGCCACAGGTGAGGGACACTACC 


900 


Db 


950 


1009 


Qy 


901 


ACTGGATCGTGTCAGAAGGTGCTGCTGAGGATAGACTGACTTTGGCCATTGGATTGAGCA 
i i i t i i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i * i i i i i i i i i i i 


960 


Db 


1010 


MMIIIMilMMMMMMIIIMMMMMMMIMMIMIMMMIMM 

ACTGGATCGTGTCAGAAGGTGCTGCTGAGGATA 


1069 


Qy 


961 


AAGGCAGAAATGGGGGCTAGTGTAACAGCATGCAGGTTGAATTGCCAA 
i i i i i i i i i i i i i i i i i i i < i i i i i i i i i i i i i i i t i i t i < i i i i i i i i > i i i i i i i i < i 


1020 


Db 


1070 


II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 M 1 1 1 1 M 1 1 1 1 M 1 1 1 1 II 

AAGGCAGAAATGGGGGCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCA 


1129 


Qy 

Db 


1021 
1130 


TGCCAGCCTTTCTGTTTTCCTC^CCTTGCTGCTCCCCTGCCCrAAGTCCCC^CCCT(^A 
i i i i i i i i i i t i i i i i i i i i i i i i i i i i i * t i i i i i i i i i i i i i i i i i i i * i i i i i i i i i 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 

TGCCAGCCTTTCTGTTTTCCTCACCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAA 


1080 
1189 


Qy 


1081 


CTTGAAACCCCATTCCCTTAAGCCAGGACTCAGAGGATCCCTTTGCCCTCTGGTTTACCT 

1 II llllll II III III II MM MM II II II II llllllllllll II II II II Mill 

CTTGAAACCCCATTCCCTTAAGCCAGGACTCAGAGGATCCCTTTG 


1140 


Db 


1190 


1249 


Qy 


1141 


GGGACTCCATCCCCAAA.CCCACTAATCACATCCCACTGACTGACCCTCTC 


1200 



Db 


1250 


i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

GGGACTCCATCCCO^CCCACTAATCACATCCCACr^CTGACCCTCTGTGATCAAAGA 


1309 


Qy 


1201 


• 

CCCTCTCTCTGGCTGAGGTTGGCT 

i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 

CCCTCTCTCTGGCTGAGGTTGGCTCTTAGCTCA 


1260 


Db 


1310 


1369 


Qy 
Db 


1261 
1370 


GGCTTTTGTGGGCATTGCTCTAACCTACTTCT 

i i i i i i i i t i i i i i i i i i i i * i i i i i i i i * i i i * i i i i i i i i i i i * i i i i i i i i t i i • i i 

IIIIIIIMIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIII 

GG CTTTTGTGGGCATTGCTCTAACCTACTTCTCA^ 


1320 
1429 


Qy 


1321 


CCCTGGAACCTCCATCCCACTCTTGTTATGACTCCA 

llllllllllllllllllllll IllliihlUf! II III III II IIIIMIIMIIII 

CCCTGGAACCTCCATCCCACTCCTGTTATGACTC 


1380 


Db 


1430 


1489 


Ov 


1381 


GAA^TGAAATAAAAfTTVTCCTACT^ 


1 A A fl 


Db 


1490 


IIIIIMIIIMIIIIIMIIMMMIIIIIIMMIIIIIIIIIIIIMIIIIIIIII 

GAACTGAAATAAAACCATCCTACGGTATCCAGGGAA 


1549 


Qy 


1441 


GGACAGGAAGGCAGCCTGGGACATTTAAAAAAATA 1475 




Db 


1550 


llllllllllllllllllllllllllllllll II 
GGACAGGAAGG CAG C CTGGGACATTTAAAAAATTA 1584 





» 



